PHOG: Probabilistic Model for Code
نویسندگان
چکیده
We introduce a new generative model for code called probabilistic higher order grammar (PHOG). PHOG generalizes probabilistic context free grammars (PCFGs) by allowing conditioning of a production rule beyond the parent non-terminal, thus capturing rich contexts relevant to programs. Even though PHOG is more powerful than a PCFG, it can be learned from data just as efficiently. We trained a PHOG model on a large JavaScript code corpus and show that it is more precise than existing models, while similarly fast. As a result, PHOG can immediately benefit existing programming tools based on probabilistic models of code.
منابع مشابه
Localization of the Lumbar Discs Using Machine Learning and Exact Probabilistic Inference
We propose a novel fully automatic approach to localize the lumbar intervertebral discs in MR images with PHOG based SVM and a probabilistic graphical model. At the local level, our method assigns a score to each pixel in target image that indicates whether it is a disc center or not. At the global level, we define a chain-like graphical model that represents the lumbar intervertebral discs and...
متن کاملRecognition of Handwritten Mathematical Symbols with PHOG Features
Converting handwritten formulas to LaTex is a challenging machine learning problem. An essential step in the recognition of mathematical formulas is the symbol recognition. In this paper we show that pyramids of oriented gradients (PHOG) are effective features for recognizing mathematical symbols. Our best results are obtained using PHOG features along with a one-againstone SVM classifier. We t...
متن کاملBerkeley PHOG: PhyloFacts orthology group prediction web server
Ortholog detection is essential in functional annotation of genomes, with applications to phylogenetic tree construction, prediction of protein-protein interaction and other bioinformatics tasks. We present here the PHOG web server employing a novel algorithm to identify orthologs based on phylogenetic analysis. Results on a benchmark dataset from the TreeFam-A manually curated orthology databa...
متن کاملNew Robust Descriptor for Image Matching
Nowadays, object recognition based on feature extraction is widely used in image matching due to its robustness to different types of image transformations. This paper introduces a new approach for extracting invariant features from interest regions. This approach is inspired from the well known Scale Invariant Feature Transform (SIFT) interest points detector and aims to improve the computatio...
متن کاملA Comparison of Seismic Safety of Steel MRF Designed According to Different Editions of Iranian Seismic Code
In this paper, efforts are made to compare the safety of steel moment resistant frames designed according to different editions of the Iranian code of Practice for seismic resistant design of buildings. Also, failure risk of a low and medium height frame which designed for high and low seismicity regions according to three editions of the code are evaluated. First, the testing cases were design...
متن کامل